Extraction of Target User Group from Web Usage Data Using Evolutionary Biclustering Approach

نویسندگان

  • R. Rathipriya
  • K. Thangavel
  • J. Bagyamani
چکیده

Data mining extracts hidden information from a database that the user did not know existed. Biclustering is one of the data mining technique which helps marketing user to target marketing campaigns more accurately and to align campaigns more closely with the needs, wants, and attitudes of customers and prospects. The biclustering results can be tuned to find users’ browsing patterns relevant to current business problems. This paper presents a new application of biclustering to web usage data using a combination of heuristics and meta-heuristics algorithms. Two-way K-means clustering is used to generate the seeds from preprocessed web usage data, Greedy Heuristic is used iteratively to refine a set of seeds, which is fast but often yield local optimal solutions. In this paper, Genetic Algorithm is used as a global optimizer that can be coupled with greedy method to identify the global optimal target user groups based on their coherent browsing pattern. The performance of the proposed work is evaluated by conducting experiment on the msnbc, a clickstream dataset from UCI repository. Results show that the proposed work performs well in extracting optimal target users groups from the web usage data which can be used for focalized marketing campaigns. al., 2004). User Segmentation is the process of developing effective schemes for categorizing and organizing meaningful groups of customers. A user segment is a group of prospects or customers who are selected from a database based on characteristics they possess or exhibit. It also allows company to differentially treat consumers in different segments. User ProfilDOI: 10.4018/jamc.2011070104 70 International Journal of Applied Metaheuristic Computing, 2(3), 69-79, July-September 2011 Copyright © 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. ing is the process of analyzing the customers of each segment in order to generalize, describe or name this set of customers based on common characteristics. It is the process of understanding and labeling a set of users. It provides valuable information about users/customers so marketers can furnish stronger, more targeted offers and each user segment is the target group of users. One-to-one marketing is the ideal marketing strategy, in which every marketing campaign or product is optimally targeted for each individual customer; but this is not always possible. Therefore, segmentation is required to distinguish similar users and put them together in a segment/group. Doubtlessly using segmentation to understand user’s needs is much easier, faster and more economical than uniquely investing to understand them particularly (Jonker et al., 2004). With proper market segmentation, enterprises can arrange the right web pages, services and resources to each target user group and build a close relationship with them. Market segmentation has consequently been regarded as one of the most critical elements in achieving successful modern marketing and customer relationship management. Click stream data is a sequence of Uniform Resource Locators (URLs) browsed by the user within a particular period of time. To discover group of users with similar behavior and motivation for visiting the particular website can be found by clustering. Traditional clustering (Lee et al., 2008) is used to segment the web users or web pages in to groups based on the existing similarities. When a clustering method is used for grouping users, it typically partitions users according to their similarity of browsing behavior under all pages. But, in the most cases web users behave similarly only on a subset of pages and their behavior is not similar over the rest of the pages. Therefore, traditional clustering methods fail to identify such user groups. To overcome this problem, concept of Biclustering was introduced. Biclustering (Bleuler et al., 2004; Chakraborty et al., 2005; Madeira et al., 2004) was first introduced by Hartigan (1972). Biclustering is the simultaneous clustering of rows and columns of the data matrix. In literature, biclustering algorithms are widely applied to the gene expression data. In this paper, it is used to mine clickstream data in order to extract target usage groups. These groups are analyzed to determine user’s behavior which is an important element in the E-Commerce applications. The rest of the paper is organized as follows. In Section 2, some of the existing work related to the biclustering approaches and user segmentations are discussed. Methods and materials required for biclustering approach are described in Section 3. Section 4 focuses on the proposed Biclustering approach using Genetic Algorithm. Analysis of experimental results is discussed in Section 5. Section 6 concludes the paper.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extraction of Web Usage Profiles using Simulated Annealing Based Biclustering Approach

In this paper, the Simulated Annealing (SA) based biclustering approach is proposed in which SA is used as an optimization tool for biclustering of web usage data to identify the optimal user profile from the given web usage data. Extracted biclusters are consists of correlated users whose usage behaviors are similar across the subset of web pages of a web site where as these users are uncorrel...

متن کامل

Data Extraction using Content-Based Handles

In this paper, we present an approach and a visual tool, called HWrap (Handle Based Wrapper), for creating web wrappers to extract data records from web pages. In our approach, we mainly rely on the visible page content to identify data regions on a web page. In our extraction algorithm, we inspired by the way a human user scans the page content for specific data. In particular, we use text fea...

متن کامل

Hybrid Swarm Intelligence- Based Biclustering Approach for Recommendation of Web Pages

This chapter focuses on recommender systems based on the coherent user’s browsing patterns. Biclustering approach is used to discover the aggregate usage profiles from the preprocessed Web data. A combination of Discrete Artificial Bees Colony Optimization and Simulated Annealing technique is used for optimizing the aggregate usage profiles from the preprocessed clickstream data. Web page recom...

متن کامل

Usage Profile Generation from Web Usage Data Using Hybrid Biclustering Algorithm

Biclustering has the potential to make significant contributions in the fields of information retrieval, web mining, and so forth. In this paper, the authors analyze the complex association between users and pages of a web site by using a biclustering algorithm. This method automatically identifies the groups of users that show similar browsing patterns under a specific subset of the pages. In ...

متن کامل

Usage Profile Generation from Web Usage Data Using Hybrid Biclustering Algorithm

Biclustering has the potential to make significant contributions in the fields of information retrieval, web mining, and so forth. In this paper, the authors analyze the complex association between users and pages of a web site by using a biclustering algorithm. This method automatically identifies the groups of users that show similar browsing patterns under a specific subset of the pages. In ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Int. J. of Applied Metaheuristic Computing

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2011